Porting NAHUJ to CUDA
نویسنده
چکیده
This white-paper reports on an enabling effort that involves porting a legacy 2D fluid dynamics Fortran code to NVIDIA GPUs. Given the complexity of both code and underlying (custom) numerical method, the natural choice was to use NVIDIA CUDA C to achieve the best possible performance. We achieved over 4.5x speed-up on a single K20 compared to the original code executed on a dual-socket E5-2687W.
منابع مشابه
An Incremental Approach to Porting Complex Scientific Applications to GPU/CUDA
This paper proposes and describes a developed methodology to port complex scientific applications originally written in FORTRAN to the nVidia CUDA. The process was developed and validated by porting an existing FORTRAN weather and forecasting algorithm to a GPU parallel paradigm. We believe that the proposed porting methodology described can be successfully utilized in several other existing sc...
متن کاملTranformation of CPU-based Applications To Leverage on Graphics Processors using CUDA
Scientific computation requires a great amount of computing power especially in floating-point operation but a high-end multi-cores processor is currently limited in terms of floating point operation performance and parallelization. Recent technological advancement has made parallel computing technically and financially feasible using Compute Unified Device Architecture (CUDA) developed by NVID...
متن کاملGpu Acceleration of the Long-wave Rapid Radiative Transfer Model in Wrf Using Cuda Fortran
This paper presents the approach and results of porting the Long-Wave Rapid Radiative Transfer Model (RRTM) component of the Weather Research and Forecast (WRF) code to the GPU using CUDA Fortran. After a brief description of the RTTM code, considerations regarding porting the application to the GPU are discussed. Included in the porting discussion are how the data structures have been modified...
متن کاملCompiler Support for High-level GPU Programming
We design a high-level abstraction of CUDA, called hiCUDA, using compiler directives. It simplifies the tasks in porting sequential applications to NVIDIA GPUs. This paper focuses on the design and implementation of a source-to-source compiler that translates a hiCUDA program into an equivalent CUDA program, and shows that the performance of CUDA code generated by this compiler is comparable to...
متن کاملAbstraction of Programming Models Across Multi-Core and GPGPU Architectures
ion of Programming Models Across Multi-Core and GPGPU Architectures Thomas H. BEACH , Ian J. GRIMSTEAD , David W. WALKER a and Nick J. AVIS a a Cardiff School of Computer Science, Cardiff University, Wales, UK Abstract. Work in the field of application acceleration devices is showing great promise, but still remains a tool largely for computer scientists with domain knowledge, given the complex...
متن کامل